Processing image and audio information for recognising discourse participation status through features of face and voice
نویسندگان
چکیده
This paper describes a system based on a 360-degree camera with a single microphone that detects speech activity in a roundtable context for the purpose of estimating discourse participation status information for each member present. We have obtained 97% accuracy in detecting participants and have shown that the use of non-verbal and backchannel speech information is a useful indicator of participant status in a discourse.
منابع مشابه
The Combinational Use Of Knowledge-Based Methods and Morphological Image Processing in Color Image Face Detection
The human facial recognition is the base for all facial processing systems. In this work a basicmethod is presented for the reduction of detection time in fixed image with different color levels.The proposed method is the simplest approach in face spatial localization, since it doesn’trequire the dynamics of images and information of the color of skin in image background. Inaddition, to do face...
متن کاملVoice Analysis in English and Persian Persuasive Texts: Pedagogical implications in focus
The main purpose of this study is to investigate how voice is realized by Iranian EFL learners in persuasive English and Persian text types. This discourse-related notion is a required criterion for writing acceptable English. However, L2 learners from cultures other than English might face problems in realizing it, or even ignore it all through their writing. In this connection, the present st...
متن کاملVoice Analysis in English and Persian Persuasive Texts: Pedagogical implications in focus
The main purpose of this study is to investigate how voice is realized by Iranian EFL learners in persuasive English and Persian text types. This discourse-related notion is a required criterion for writing acceptable English. However, L2 learners from cultures other than English might face problems in realizing it, or even ignore it all through their writing. In this connection, the present st...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملResearch of Video Retrieval based on Image and Audio Feature
Recently as computer technology and multimedia information have developed, text information as well as various types of image information can easy to obtain and be stored. In this paper, we proposed the efficient method of context based image retrieval to extract the face image and audio features in the video sequence. The eigenvectors for face can be obtained by the principal component analysi...
متن کامل